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(57) A speech encoding method includes generat- 
ing information representing characteristics of a synthe- 
sis fitter (120), and generating an excitation signal for 
exciting the synthesis filter, the excitation signal includ- 
ing a pulse train generated by setting one or more 
pulses at a predetermined number of pulse positions 
selected from a plurality of pulse position candidates 
adaptively changed in accordance with the characteris- 
tics of the speech signal. A speech decoding method 
includes inputting the excitation signal to a synthesis fil- 
ter for reconstructing a speech signal. 
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Description 

[0001] The present invention relates to an encod- 
ing/decoding method of a low bit rate used for digital tel- 
ephone, voice memo, etc. s 
[0002] In recent years, the encoding techniques have 
found wide applications in the portable telephone or the 
internet in which the speech and music sound are trans- 
mitted and stored by being compressed at a low bit rate. 
Such techniques include the CELP method (Code 10 
Excited Linear Prediction (M.R.Schroeder and B.S. at 
al). "Code Excited Linear Prediction (CELP): High Qual- 
ity Speech at Very Low Bit Rates", Proc. ICASSP, 
pp.937-940, 1985 (reference 1) and W.S.KIeijin, 
DJ.Krasinski et al. "Improved Speech Quality and Effi- is 
cient Vector Quantization in SELP". Proc. ICASSP. 
pp.155-158, 1988 (reference 2)). 
[0003] The CELP is an encoding scheme based on 
the linear predictive analysis. An input speech signal is 
divided into a linear prediction coefficient representing 20 
the phoneme information and a prediction residual sig- 
nal representing the sound level, etc. according to the 
linear predictive analysis. Based on the linear predictive 
coefficients, a recursive digital filter called a synthesis 
fitter is configured, and supplied with a prediction resid- ss 
ual signal as an excitation signal thereby to restore the 
original input speech signal. 

[0004] For encoding at low bit rate, it is necessary to 
encode, with as low bit rates as possible, the linear pre- 
dictive coefficients constituting the synthesis fitter infer- 30 
mation representing the characteristics of the synthesis 
filter and the prediction residual signal constituting the 
characteristic of the synthetic filter. In the CELP 
scheme, two types of signal including the pitch vector 
and the noise vector are each multiplied by an appropri- 35 
ate gain and added to each other thereby to generate 
an excitation signal in the form encoded from the predic- 
tion residual signal. A method of generating the pitch 
vector is described in detail in reference 2 for example. 
There is proposed a method of using a fixed coded vec- 40 
tor on a rising portion (onset portion) of a speech other 
than the method of the reference 2. However, in the 
present invention, such vectors are used as pitch vec- 
tors. 

[0005] The noise vector is normally generated by stor- 45 
ing a multiplicity of candidates in a stochastic codebook 
and selecting an optimum one. In a method of searching 
for a noise vector, all the noise vectors are added to the 
pitch vector and then a synthesis speech signal is gen- 
erated through a synthetic filter. The error of this synthe- so 
sis speech signal with respect to the input signal is 
evaluated thereby to select a noise vector generating a 
synthesis speech signal with the smallest error. What is 
most important for the CELP scheme, therefore, is how 
efficiently to store the noise vectors in the stochastic ss 
codebook. 

[0006] The algebraic codebook (J-RAdoul et al. "Fast 
CELP Coding based on algebraic codes", Proc. 



ICASSP '87, pp.1957-1960 (reference 3)) has a simple 
structure in which the noise vector is indicated only by 
the presence or absence of a pulse and the sign (+, -) 
thereof. The algebraic codebook, as compared with the 
stochastic codebook with a plurality of noise vectors 
stored therein, need not store any code vector and has 
the feature of a very small calculation amount. Also, the 
sound quality of the system using the algebraic code- 
book is not inferior to that of the prior art. and therefore 
has recently been used for various standard schemes. 
[0007] In the algebraic codebook, however, the dete- 
rioration of the sound quality becomes more conspicu- 
ous with the decrease in the encoding bit rate. One 
reason is the shortage of the pulse position information. 
Specifically, in view of the fact that the algebraic code- 
book algebraically simplifies the positional information 
of the pulse, in spite of the advantage described above, 
position candidates sometimes exist at points where a 
pulse rise is not required for low bit rate encoding but 
not at required points. This not only deteriorates the effi- 
ciency but also deteriorates the sound quality. 
[0008] Another reason for the deterioration of the 
sound quality when using the algebraic codebook is the 
shortage of the number of pulses. The shortage of 
pulses gives rise to a pulse-like noise in the decoded 
speech. This is because an excitation signal is gener- 
ated from a pulse train and the presence or absence of 
a pulse can be easily acknowledged perceptually with 
the decrease in the number of pulses. For improving the 
sound quality, it is necessary to alleviate the pulse-like 
noise. 

[0009] As described above, the conventional algebraic 
codebook has the advantage of a simple structure and 
a small amount of calculation, but poses the problem 
that the quality of the decoded speech is deteriorated 
due to the shortage of the pulses and the positional 
information of the pulse train making up the excitation 
signal for the synthesis filter at a low bit rate. 
[001 0] The object of the present invention is to provide 
a speech encoding/decoding method which can secure 
a superior sound quality even at a low bit rate encoding. 
[0011] According to a first aspect of the invention, 
there is provided a speech encoding method comprising 
the steps of generating at least information representing 
the characteristics of a synthesis fitter for a speech sig- 
nal, and generating an excitation signal for exciting the 
synthesis fitter, including a pulse train generated by set- 
ting pulses at a predetermined number of pulse posi- 
tions selected from the pulse position candidates 
adaptively changed in accordance with the characteris- 
tics of the speech signal. 

[0012] According to another aspect of the invention, 
there is provided a speech decoding method for input- 
ting an excitation signal to a synthesis fitter and decod- 
ing a speech signal, the excitation signal containing a 
pulse train generated by setting pulses at a predeter- 
mined number of pulse positions selected from the 
pulse position candidates adaptively changed in 
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accordance with the characteristics of the speech sig- 
nal. 

[0013] In a speech encoding/decoding method 
according to this invention, the excitation signal for 
exciting the synthesis filter contains a pulse train gener- 
ated by setting pulses at a predetermined number of 
pulse positions selected from the pulse position candi- 
dates adaptively changed in accordance with the char- 
acteristics of the speech signal. More specifically, the 
pulse position candidates are assigned in such a man- 
ner that more candidates exist at a domain of larger 
power of the speech signal. 

[0014] Also, the excitation signal can be configured to 
include a pulse train generated by setting pulses at all 
the pulse position candidates adaptively changing in 
accordance with the characteristics of the voice signal 
and optimizing the amplitude of each pulse with prede- 
termined means. In such a case, more specifically, the 
pulse position candidates are assigned so that more 
candidates exist at a domain of larger power of the voice 
signal. 

[0015] Alternatively, the excitation signal can be gen- 
erated by use of a pulse train generated by setting 
pulses at a predetermined number of pulse positions 
selected from first pulse position candidates changing 
adaptively in accordance with the characteristics of the 
voice signal or a pulse train generated by setting pulses 
at a predetermined number of pulse positions selected 
from second pulse position candidates including a part 
or the whole of the positions not used as the first pulse 
position candidates. In this case, the first pulse position 
candidates are arranged, more specifically, so that more 
candidates exist at a domain that the power of the 
speech signal is larger. 

[0016] Also, in the case where the excitation signal 
includes a pitch vector and a noise vector, the noise 
vector is generated by setting pulses at a predeter- 
mined number of pulse positions selected from the 
pulse position candidates changed in accordance with 
the shape of the pitch vector. More specifically, more 
pulse position candidates are located at a domain of 
larger power of the pitch vector. 

[001 7] Also, the noise vector can be configured by use 
of a pulse train generated by setting pulses at a prede- 
termined number of pulse positions selected from posi- 
tion candidates set based on the position candidate 
density function determined from the shape of the pitch 
vector, tn such a case, the pulse position candidates 
are, more specifically, arranged in such a manner that 
more candidates exist at a place where the value of the 
position candidate density function is larger. The posi- 
tion candidate density function is a function describing 
the relationship between the probability of arranging the 
pulses and the power of the pitch vector. 
[0018] Further, in the case of using a compensation fil- 
ter such as a pitch period emphasis fitter, a modified 
pitch vector is generated from the pitch vector applied 
through a f iter based on this inverse characteristic, and 



the noise vector is generated by setting pulses at a pre- 
determined number of pulse positions selected from the 
pulse position candidates changing in accordance with 
the shape of the inverse correction pitch vector. In such 
5 a case, the pulse position candidates are. more specifi- 
cally, arranged in such a manner that more candidates 
exist at a domain that the power of the inverse correc- 
tion vector is larger. 

[001 9] By adaptively changing the pulse position can- 
w didates in accordance with the characteristics such as 
the power distribution of the speech signal as described 
above, the encoding efficiency is improved even when 
using an algebraic codebook in which the pulse posi- 
tions and the number of pulses are reduced due to the 
is low bit rate. Thus, the bit rate can be reduced while 
maintaining the quality of the decoded speech. Also, 
since the pitch vector is used for producing pulse posi- 
tion candidates, the adaptation of the pulse position 
candidates becomes possible without any additional 
20 information. 

[0020] In another speech encoding/decoding method 
according to this invention, an excitation signal including 
a pitch vector and a noise vector contains a pulse train 
shaped by a pulse shaping filter having the characteris- 
es tics determined based on the shape of the pitch vector. 
[0021] With this configuration, the pulse-like noise 
contained in the decoded speech due to the reduced 
number of pulses is alleviated, and even in the case 
where the pulse positions or the number of pulses is 
30 reduced due to the low bit rate, the bit rate can be 
reduced while maintaining the quality of the decoded 
speech. 

[0022] Further, in a speech encoding/decoding 
method according to this invention, an excitation signal 

35 is generated, including a pulse train generated by set- 
ting pulses at a predetermined number of pulse posi- 
tions selected from the pulse position candidates 
adaptively changed in accordance with the characteris- 
tics of the speech signal. Also, the pulse train can be 

40 shaped by a pulse shaping fitter having a characteristic 
determined based on the shape of the pitch vector. 
[0023] This summary of the invention does not neces- 
sarily describe all necessary features so that the inven- 
tion may also be a sub-combination of these described 

45 features. 

[0024] The invention can be more fully understood 
from the following detailed description when taken in 
conjunction with the accompanying drawings, in which: 

so FIG. 1 is a block diagram showing a speech encod- 
ing system according to a first embodiment of the 
present invention; 

FIG. 2 is a flowchart showing the steps of selecting 
pulse position candidates according to the first 
55 embodiment of the invention ; 

FIGS. 3A, 3B, 3C, 3D, and 3E are diagrams show- 
ing the manner of processing at each step in FIG. 2; 
FIG. 4 is a diagram showing the relation between 
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the power envelope of the pitch vector and the 
pulse position candidates according to the first 
embodiment; 

FIG. 5 is a block diagram showing a speech decod- 
ing system according to the first embodiment; s 
FIG. 6 is a block diagram showing a speech encod- 
ing system according to a second embodiment of 
the invention; 

FIG. 7 is a block diagram showing a speech decod- 
ing system according to the second embodiment; io 
FIG. 8 is a block diagram showing a speech encod- 
ing system according to a third embodiment of the 
invention; 

FIG. 9 is a Mock diagram showing a speech decod- 
ing system according to the third embodiment; is 
FIG. 10 is a block diagram showing a speech 
encoding system according to a fourth embodiment 
of the invention; 

FIGS. 11A to 11C are diagrams representing the 
power envelope of the pitch vector and the position so 
candidate density function and the position candi- 
date density function; 

FIG. 12 is a block diagram showing a speech 
decoding system according to the fourth embodi- 
ment; 25 
FIG. 13 is a block diagram showing a speech 
encoding system according to a fifth embodiment of 
the invention; 

FIG. 14 is a block diagram showing a speech 
decoding system according to the fifth embodiment; 30 
FIG. 15 is a block diagram showing a speech 
encoding system according to a sixth embodiment 
of the invention; 

FIG. 16 is a diagrams for explaining how to form 
noise vectors; and 35 
FIG. 17 is a block diagram showing a speech 
decoding system according to the sixth embodi- 
ment. 

[0025] FIG. 1 shows a speech encoding system using 40 
a speech encoding method according to a first embodi- 
ment. This speech encoding system comprises input 
terminals 101, 106, an LPC analyzer section 110, an 
LPC quantizer section 1 11, a synthesis section 120, a 
perceptually weighting section 130, an adaptive code- 45 
book 141, a pulse position candidate search section 
142, an adaptive algebraic codebook 143. a code selec- 
tor section 150, a pitch enhancement section 160, gain 
multiplier sections 102, 103 and adder sections 104, 
105. 

[0026] The input terminal 1 01 is supplied with an input 
speech signal to be encoded, in units of one-frame 
length, and in synchronism with this input, a linear pre- 
diction analysis is conducted whereby a linear predic- 
tion coefficient (LPC) corresponding to the vocal track ss 
characteristic is determined. The LPC is quantized by 
the LPC quantizer section 111, and the quantization 
value is input to the synthesis section 120 as synthesis 



section information indicating the characteristic of the 
synthesis section 120. The synthesis section 120 usu- 
ally consists of a synthesis filter. An index A indicating 
the quantization value is output as the result of encod- 
ing to a multiplexer section not shown. 
[0027] The adaptive codebook 1 41 has stored therein 
the excitation signals input in the past to the synthesis 
section 120. The excitation signal constituting an input 
to the synthesis section 120 is a prediction residual sig- 
nal quantized in the linear prediction analysis and corre- 
sponds to the glotall source containing the information 
on the sound level or the like. The adaptive codebook 

141 cuts out the waveform in the length corresponding 
to the pitch period from the past excitation signal and by 
repeating this process, generates a pitch vector. The 
pitch vector is normally determined in units of several 
subframes into which a frame is divided. 

[0028] The pulse position candidate search section 

142 determines by calculation the positions at which 
pulse position candidates are set in the subframe based 
on the pitch vector determined by the adaptive code- 
book 141 and outputs the result of the calculation to the 
adaptive algebraic codebook 143. 

[0029] The adaptive algebraic codebook 143 
searches the pulse position candidates input from the 
pulse position candidate search section 142 for a prede- 
termined number of pulse positions and the signs (+ or 
-) thereof in such a manner that the distortion against 
the input speech signal excluding the effect of the pitch 
vector is minimized under the perceptual weight. 
[0030] The pulse train output from the adaptive alge- 
braic codebook 143 is given a periodicity in units of 
pitches by the pitch enhancement section 160 as 
required. The pitch enhancement section 160 usually 
consists of a pitch filter. The pitch enhancement section 
160 is supplied with the information L on the pitch 
period determined by the search of the adaptive code- 
book 1 43 from the input terminal 1 06 and thus the pulse 
train is given a periodicity of the pitch period. 
[0031 ] The pitch vector output from the adaptive code- 
book 141 and the pulse train output from the adaptive 
algebraic codebook 143 and given a periodicity by the 
pitch enhancement section 160 as required are multi- 
plied by the gain GO for the pitch vector and the gain G1 
for the noise vector at the gain multiplier sections 102, 
103, respectively, added to each other at the adder sec- 
tion 104, and applied to the synthesis section 120 as an 
excitation signal. The optimum gains GO, Gl are 
selected from the gain codebook (not shown) which nor- 
mally stores a plurality of gains. 
[0032] The code selector section 1 50 outputs an index 
B indicating the pitch vector selected by the search of 
the adaptive codebook 141. an index C indicating the 
pulse train selected by the search of the adaptive alge- 
braic codebook 1 43. and an index G indicating the gains 
GO, G1 selected by the search of the gain codebook. 
These indexes B, C, G and the index A indicating the 
synthesis filter information constituting the quantization 
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value of the LPC from the LPC quantizer section 1 1 1 
are multiplexed in a multiplexer section not shown and 
transmitted as an encoded stream. 
[0033] Now, an explanation will be given of the pulse 
position candidate search section 1 42 and the adaptive s 
algebraic codebook 143 constituting the features of the 
present embodiment. 

[0034] According to this embodiment, the fact that the 
pulses tend to be set mainly around the sections where 
the power of excitation signal is large is utilized to permit io 
only the bit rate to decrease without deteriorating the 
sound quality. Thus, pulse position candidates are set 
for each subframe in such a manner as to assign more 
position candidates for sections where the power of the 
excitation signal is larger. 75 
[0035] The pitch vector resembles the shape of an 
ideal excitation signal. It is therefore effective to set 
pulse position candidates by the pulse position candi- 
date search section 142 based on the pitch vector 
determined by the search of the adaptive codebook 20 
141. The same pitch vector can be obtained on the 
decoding side as on the encoding side, and therefore it 
is not necessary to generate additional information for 
the adaptation of pulse position candidates. 
[0036] In the case where pulse position candidates 25 
are assigned only at points of large power for the adap- 
tation of the pulse position candidates, the sound quality 
may be deteriorated due to the continuous lack of the 
position candidates in a section of small power. Various 
methods of adaptation of pulse position candidates are 30 
conceivable. The methods described below, for exam- 
ple, make possible the adaptation with a small deterio- 
ration of the sound quality. 

[0037] With reference to the flowchart of FIG. 2, an 
explanation will be given of the steps of adaptation of 35 
pulse position candidates by the pulse position candi- 
date search section 142. FIGS. 3A to 3D show an input 
pitch vector waveform (FO), power (F1 ) of this input pitch 
vector waveform, smoothed power (F2) and an inte- 
grated value (F3) in sample direction of the smoothed 40 
power, each corresponding to the steps of FIG. 2. 
[0038] A similar processing is possible by use of other 
measures indicating the waveform such as an absolute 
value (square root of the power) of the amplitude value 
other than the power. In this embodiment, these meas- 45 
ures are collectively defined as the power. 
[0039] First, the power (F1 ) of FIG. 3B is calculated for 
the input pitch vector (FO) of FIG. 3A (step S1 ), and then 
the power (F1) is smoothed as shown in FIG. 3C 
thereby to produce the smoothed power (F2) (step S2). so 
The power can be smoothed, for example, by a method 
of weighting with a window of several samples and tak- 
ing a moving average. 

[0040] Next, the power smoothed in step S2 is inte- 
grated for each sample (step S3). The manner of this ss 
operation is shown in FIG. 3D. Specifically, let p(n) be 
the smoothed power of the n-th sample, q(n) be the inte- 
grated value of the smoothed power p(n) and L be the 



subframe length. The integrated value q(n) is deter- 
mined as 

q(n) = p(n) + q(n-1) + C (n = 0 L-1) 

where C is a constant for adjusting the degree of the 
density of pulse position candidates. 
[0041 ] Pulse position candidates are calculated using 
this integrated value q(n) (step S4). In this case, the 
integrated value is normalized so that the number of 
position candidates determined by the integrated value 
for the last sample is M. The position of the m-th candi- 
date can be determined as Sm in correspondence with 
the integrated value as shown in FIG. 3D Position can- 
didates in the number of M can be determined by 
repeating this process for m of 0 to M-1 . 
[0042] FIG. 4 shows the relation between the pulse 
candidate positions determined as described above and 
the power of the pitch vector. The solid curve represents 
the power envelope of the pitch vector, and the arrows 
pulse position candidates. As shown in this diagram, the 
pulse position candidates are distributed densely where 
the pitch vector has a large power and progressively 
become coarse according as the power decreases. As 
a result. puJse positions can be selected more accu- 
rately where the power of the pitch vector is large. Also, 
even in the case where the number of pulse position 
candidates decreases due to the low bit rate, the encod- 
ing of high sound quality is possible by concentrating a 
few number of pulse position candidates adaptively at 
points of large power. 

[0043] Next, the position candidates thus determined 
are distributed among channels (step S5). Among vari- 
ous methods of distribution available, the one shown in 
FIG. 3E is desirable in which the position candidates are 
distributed in staggered fashion among the channels. In 
this way, the adaptive algebraic codebook 143 is deter- 
mined. In the search process, the optimum position and 
the sign of a pulse is selected from each of the channels 
(Chi, Ch2, Ch3) in the adaptive algebraic codebook 
143, thereby generating a noise vector made up of three 
pulses. 

[0044] In the case where the subframe length is 80 
samples, for example, substantially no perceptual dete- 
rioration is felt when the above-mentioned method is 
used even if the pulse position candidates are reduced 
to about 40 samples. 

[0045] In the algebraic codebook, the pulse amplitude 
is normally either +1 or -1. Nevertheless, a method has 
been proposed which uses a pulse having amplitude 
information. For example, reference 4 (Chang Deyuan, 
"An 8 kb/s low complexity ACELP speech codec." 1996 
3rd International Conference on Signal Processing, pp. 
671-4. 1996) discloses a method in which the pulse 
amplitude is selected from 1.0. 0.5. 0. -0.5 and -1.0. 
Also, a multi-pulse scheme providing a kind of pulse 
excitation signal configured of a pulse train having an 
amplitude is described in reference 5 (K. Ozawa and T. 
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Araseki, "Low Bit Rate Multi-pulse Speech Coder with 
Natural Speech Quality," IEEE Proc. ICASSP '86. pp. 
457-460, 1986). The present invention is also applicable 
to the case represented by the above-mentioned exam- 
ples in which the pulse has an amplitude. 5 
[0046] Now, a speech decoding system correspond- 
ing to the speech encoding system of FIG. 1 will be 
explained with reference to FIG. 5. 
[0047] The same component parts having the same 
function as the corresponding ones in FIG. 1 will be des- w 
ignated by the same reference numerals, respectively. 
The speech decoding system of FIG. 5 comprises a 
synthesis section 120. a LPC dequantizer section 121, 
an adaptive codebook 141, a pulse position candidate 
search section 142, an adaptive algebraic codebook is 
143, a pitch enhancement section 160, gain multiplier 
sections 102, 103 and an adder section 104. The 
speech decoding system is supplied with an encoded 
stream transmitted from the speech encoding system of 
FIG. 1. 20 
[0048] The encoded stream thus input is applied to a 
demultiplexer section 121 not shown, and output after 
being demultiplexed by the demultiplexer section 121 
into the index A of the synthesis fitter information 
described above, the index B indicating the pitch vector 25 
selected by the search of the adaptive codebook 141 , 
the index C indicating the pulse train selected by the 
search of the adaptive algebraic codebook 143. the 
index G indicating the gains GO. G1 selected by the 
search of the gain codebook, and the index L indicating 30 
the pitch period. 

[0049] The index A is decoded by the LPC dequan- 
tizer section 121 thereby to determine the LPC consti- 
tuting the synthesis filter information, which is input to 
the synthesis section 120. The indexes B and C are 3s 
input to the adaptive codebook 141 and the adaptive 
algebraic codebook 143, respectively. The pitch vector 
and the pulse train are output from these codebooks 
141, 143. respectively In this case, the adaptive alge- 
braic codebook 1 43 outputs a pulse train by determining 40 
the pulse positions and the signs from the index B and 
the adaptive algebraic codebook 143 formed by the 
pulse position candidate search section 142 based on 
the pitch vector input from the adaptive codebook 141. 
The pulse train output from the adaptive algebraic code- 45 
book 143 is given a periodicity of the pitch period L by 
the pitch enhancement section 160 as required. 
[0050] The pitch vector output from the adaptive code- 
book 141 and the pulse train output from the adaptive 
algebraic codebook 143 and given a periodicity by the so 
pitch enhancement section 160 as required are multi- 
plied by the gain GO for the pitch vector and the gain G1 
for the noise vector at the gain multiplier sections 102, 
103. respectively, after which they are added to each 
other at the adder section 104 and applied to the syn- 55 
thesis section 120 as an excitation signal. A recon- 
structed speech signal is output from this synthesis 
section 120. The gains GO, G1 are selected from a gain 



codebook not shown according to the index G. 
[0051 ] As described above, according to this embodi- 
ment, only the bit rate can be reduced while maintaining 
the high speech quality. So, the speech encod- 
ing/decoding of high quality can be realized with low bit 
rate. 

[0052] FIG. 6 shows a speech encoding system 
according to a second embodiment of the invention. 
This speech encoding system has a configuration simi- 
lar to the configuration of the first embodiment shown in 
FIG. 1, except that in the present embodiment, the 
pulse position candidate search section 142 and the 
adaptive algebraic codebook 143 are not included, and 
the adaptive algebraic codebook 143 is replaced by an 
ordinary stochastic codebook 144 and further a pulse 
shaping fitter analyzer section 161 and a pulse shaping 
section 162 are added thereto. 

[0053] Now, the steps of processing according to this 
embodiment will be explained. The input speech signal 
is subjected to the LPC analysis and LPC quantization, 
followed by the search of the adaptive codebook 141 in 
the same steps as in the first embodiment. The stochas- 
tic codebook 144 is configured of an algebraic code- 
book, for example, in this embodiment. 
[0054] The pulse shaping filter analyzer section 161 
determines and outputs the parameter of the pulse 
shaping section 162 which normally consists of a digital 
filter, based on the pitch vector determined by searching 
the adaptive codebook 141 . The pulse shaping section 
162 filters the output of the stochastic codebook 144 
and outputs a shaped noise vector. 
[0055] As in the first embodiment, the noise vector is 
given a periodicity using the pitch enhancement section 
160 as required. The gains GO. G1 for the pitch vector 
and the noise vector are determined and an index is 
output. The parameters of the pulse shaping section 
162 are determined from the pitch vector, and therefore 
the addition of new information is not required. 
[0056] The feature of this embodiment resides in that 
the pulse shaping section 162 is set based on the wave- 
form of the pitch vector thereby to shape the pulse train 
output from the stochastic codebook 144 including an 
algebraic codebook. As described with reference to the 
first embodiment, the low rate encoding reduces the 
number of pulse positions and pulses and thus deterio- 
rates the sound quality conspicuously. A reduced 
number of pulses causes a conspicuous pulse-like 
noise in the decoded speech. The use of the pulse 
shaping section 162 as in the present embodiment, 
however, remarkably alleviates the pulse-like noise. 
[0057] Various methods are available for designing 
the pulse shaping section 162. A first example is to uti- 
lize the phenomenon that the excitation signal for excit- 
ing the synthesis filter, if phase-equalized, becomes a 
pulse-like signal. In the case where a phase equaliza- 
tion inverse fitter is used, therefore, a waveform similar 
to the ideal excitation signal is produced from a pulse- 
like signal input. The disadvantage of the conventional 
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method of using a pulse waveform lies in that the phase 
information otherwise contained in the ideal excitation 
signal is lacking. The decreased number of pulses 
makes this problem conspicuous. In view of this, as in 
this example, the phase information is added to the 5 
pulse shaping section 162, thereby making it possible to 
generate a waveform similar to the ideal excitation sig- 
nal from a pulse waveform. 

[0058] In this first example, the information on the filter 
coefficient of the phase equalization inverse filter is io 
required to be transmitted, and the bit rate is increased 
correspondingly. Thus, a second example method con- 
ceivable is to employ a pulse shaping section 162 using 
a pitch vector as an approximation of the phase informa- 
tion. In a voiced section or the like, the pitch vector is is 
similar in shape to the excitation signal and therefore 
the phase information can be extracted. 
[0059] As a specific example method, a pulse shaping 
filter can be used, in which synchronized points such as 
peak points of the pitch vector are determined and a 20 
waveform of several samples is extracted from the par- 
ticular synchronized point as an impulse response of 
the pulse shaping filter. The effective length of the wave- 
form thus extracted is about 2 to 3 samples. It is also 
effective to ''window" and thereby attenuate the 25 
extracted samples before use. Another advantage is 
that since the same pitch vector is produced on both the 
decoding and encoding sides, a new transmission bit is 
not required. At the time of searching the stochastic 
codebook 144, the pulse shaping section 162 remains 30 
in constant operation. By calculating the impulse 
response together with that of the synthesis section 1 20 
in advance, therefore, the calculation amount can be 
reduced. 

[0060] FIG. 7 shows a speech decoding system cor- 35 
responding to the speech encoding system of FIG. 6. 
The component parts having the same functions as the 
corresponding component parts in FIG. 6 are desig- 
nated by the same reference numerals, respectively. 
The speech decoding system of FIG. 7 includes the 40 
synthesis section 120, a LPC dequantizer section 121, 
an adaptive codebook 141 , a stochastic codebook 144, 
a pulse shaping filter analyzer section 161, a pulse 
shaping section 162, a pitch enhancement section 160, 
gain multiplier sections 102, 103 and an adder section 45 
104. This system is supplied with an encoded stream 
transmitted from the speech encoding system of FIG. 6. 
[0061 ] The encoded stream is input to a demultiplexer 
section not shown, which produces an output in divided 
forms including an index A of the synthesis filter irtfor- so 
mation described above, an index B indicating the pitch 
vector selected by the search of the adaptive codebook 
141, an index C indicating the pulse train selected by 
the search of the stochastic codebook 144, and an 
index G indicating the gains GO. G1 selected by the ss 
search of the gain codebook. The pitch period L is cal- 
culated by the index B. 

[0062] The index A is decoded by the LPC dequan- 



tizer section 121 into the synthesis filter information and 
input to the synthesis section 1 20. The indexes B and C 
are input to the adaptive codebook 141 and the sto- 
chastic codebook 144, respectively, from which a pitch 
vector and a pulse train are output. 
[0063] In this case, the pulse train output from the sto- 
chastic codebook 1 44 is filtered through the pulse shap- 
ing section 162 with the filter coefficient thereof set by 
the pulse shaping filter analyzer section 161 based on 
the pitch vector determined by the search of the adap- 
tive codebook 141, and then given a periodicity of the 
pitch period L by the pitch enhancement section 160 as 
required. 

[0064] The pitch vector output from the adaptive code- 
book 141 and the pulse train output from the stochastic 
codebook 144 and modified by the pulse shaping sec- 
tion 162 and the pitch enhancement section 160 are 
multiplied by the gain GO for the pitch vector and by the 
gain G1 for the noise vector at the gain multiplier sec- 
tions 102, 103, respectively. The resulting signals are 
added to each other, input to the synthesis section 120 
as an excitation signal, and from the synthesis section 
120, output as a synthesized decoded speech signal. 
The gains GO, Gl are selected from the gain codebook 
not shown according to the index G. 
[0065] In this way, according to this embodiment the 
pulse shaping section 162 is used. Even in the case 
where an algebraic codebook with a reduced number of 
pulses due to the low rate encoding is used as the sto- 
chastic codebook 1 44, therefore, only the bit rate can be 
effectively reduced while maintaining the sound quality 
of the decoded speech. 

[0066] FIG. 8 shows a speech encoding system 
according to a third embodiment of the invention. This 
speech encoding system has such a configuration that 
the pulse shaping filter analyzer section 161 and the 
pulse shaping section 162 described with reference to 
the second embodiment are added to the configuration 
of the first embodiment. 

[0067] Now, the steps of processing according to this 
embodiment will be explained. Like in the first embodi- 
ment, the first step to be executed is the LPC analysis 
and the LPC quantization. After complete search of the 
adaptive codebook 1 41 , a pitch vector is delivered to the 
pulse position candidate search section 142 and the 
pulse shaping filter analyzer section 161. The pulse 
position candidate search section 142 determines pulse 
position candidates by the method described with refer- 
ence to the first embodiment and produces an adaptive 
algebraic codebook 143. The pulse shaping filter ana- 
lyzer section 161 determines the parameters of the 
pulse shaping section 162 as described with reference 
to the second embodiment. The parameters are nor- 
mally the filter coefficients and the pulse shaping sec- 
tion normally consists of a digital filter. 
[0068] In the search of the adaptive algebraic code- 
book 143, the pulse train output is shaped by the pulse 
shaping section 162. In actual search, the impulse 
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response of the pulse shaping section 162 and the pitch 
enhancement section 160 is combined with the synthe- 
sis section 120, and therefore the calculation amount is 
reduced. 

[0069] FIG. 9 shows a speech decoding system cor- s 
responding to the speech encoding system of FIG. 8. 
The operation of this speech decoding system is obvi- 
ous from the operation of the speech decoding system 
described with reference to the first and second embod- 
iments. Therefore, the same component parts as the io 
corresponding ones in FIGS. 1, 7 and 8 are designated 
by the same reference numerals, respectively, and will 
not be described in detail. 

[0070] As described above, this embodiment uses the 
pulse position candidate search section 142 and the 75 
adaptive algebraic codebook 143 described with refer- 
ence to the first embodiment and the pulse shaping filter 
analyzer section 161 and the pulse shaping section 152 
described with reference to the second embodiment at 
the same time. Even in the case where a few number of 20 
pulses are selected from the limited position candi- 
dates, therefore, a high sound quality can be main- 
tained, and a speech encoding system of high sound 
quality and low bit rate can be realized. 
[0071] FIG. 10 shows a block diagram of a speech 2s 
encoding system according to a fourth embodiment of 
the invention. This speech encoding system has the 
same configuration as the system of the first embodi- 
ment except that the pulse position candidate search 
section in the first embodiment includes a pitch vector 30 
smoothing section 171, a position candidate density 
function calculation section 172 and a position candi- 
date calculation section 173. 

[0072] The processing steps of this embodiment will 
be explained. As in the first embodiment, the first step is 35 
the LPC analysis and the LPC quantization. Upon com- 
plete search of the adaptive codebook 141, the pitch 
vector is delivered to the pitch vector smoothing section 
1 71 of the pulse position candidate search section 142. 
The pitch vector smoothing section 171 subjects the 40 
pitch vector to the processing of steps S1 to S2 in the 
flowchart of FIG. 2, for example, and determines and 
outputs a power envelope of the pitch vector. In the 
position candidate density function calculation section 
172. the power envelope is output by being converted 45 
into the position candidate density function. The posi- 
tion candidate calculation section 173 calculates pulse 
position candidates using this position candidate den- 
sity function instead of the power envelope, and accord- 
ing to the pulse position candidates thus obtained, so 
produces an adaptive algebraic codebook 143. Subse- 
quent process is the same as that of the first embodi- 
ment. 

[0073] The feature of this embodiment lies in the 
method of processing in the pulse position candidate 55 
search section 142. According to the first embodiment, 
the power envelope of the pitch vector is used directly 
for adaptation of the pulse position candidates. In the 



present embodiment, in contrast, the power envelope is 
used for adaptation after being converted into the posi- 
tion candidate density function. This will be explained in 
detail with reference to FIGS. 11A to 11C. FIG. 11A 
shows the power envelope of the pitch vector output 
from the pitch vector smoothing section 1 71 . In the posi- 
tion candidate density function calculation section 172, 
the position candidate density function (FIG. 11B) is 
generated from the power envelope of the pitch vector 
(FIG. 11 A). In the process, the conversion is effected 
using a function f indicating the correspondence 
between the value (x) of the power envelope and the 
value f(x) of the position candidate density function 
shown in FIG. 11C. An example method of generating 
the function f is by determining it in advance statistically 
by processing a great number of learned speeches. 
Also, the table data can be used instead of the function. 
[0074] The same pulse position candidate search sec- 
tion 142 including the function f for conversion is pro- 
vided for the encoder and the decoder. Therefore, there 
is no need of sending information on the adaptation, 
and the bit rate is not increased as compared with the 
case in which no adaptation is performed. 
[0075] FIG. 12 shows a configuration of a speech 
encoding system according to this embodiment corre- 
sponding to the speech encoding system of FIG. 10. 
The operation of this speech encoding system is obvi- 
ous from the operation of the speech encoding system 
explained in the first to third embodiments, and will not 
be explained in detail. 

[0076] As described above, according to this embodi- 
ment the value of the power envelope of the pitch vec- 
tor and the density of the pulse position candidates are 
converted using the function f, and therefore the 
processing steps become somewhat complicated as 
compared with the first embodiment. Nevertheless, the 
position candidates can be distributed more accurately. 
Also, the first embodiment can be regarded as the same 
case as the one in which x = f(x) in this embodiment 
[0077] FIG. 13 shows a block diagram of a speech 
encoding system according to a fifth embodiment of the 
invention. This speech encoding system has the same 
configuration as the first embodiment except that the 
pulse position candidate search section of the first 
embodiment includes the pitch filter inverse calculation 
section 174, the smoothing section 1 75 and the position 
candidate calculation section 173. 
[0078] Now, the processing steps of this embodiment 
will be explained. As in the first embodiment, the first 
step is the LPC analysis and the LPC quantization. After 
complete search of the adaptive codebook 141, the 
pitch vector is delivered to the pitch filter inverse calcu- 
lation section 1 74 of the pulse position candidate search 
section 142. The pitch filter inverse calculation section 
174 makes a calculation for expressing the inverse 
characteristic of the pitch enhancement section 160. 
Assume, for example, that the transfer function P(z) of 
the pitch filter is given as 
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P(z)=1-az A (-L) (1) 

The pitch filter inverse calculation section 1 74 can use a 
filter with the transfer function Q(z) given as 

5 

Q(z) = 1/(1 -baz*(-L)) (2) 

where a is a constant, b the degree of inverse character- 
istic, and when b = 1 , Q(z) becomes an inverse filter of 
P(z). The input pitch vector is output after being w 
inversely calculated, and the smoothing section 175 
determines the power envelope in the same manner as 
the pitch vector smoothing section 171 of the fourth 
embodiment. In the position candidate calculation sec- 
tion 173, the pulse position candidates are selected is 
according to this power envelope and the adaptive alge- 
braic codebook 143 is produced. Subsequent proc- 
esses are similar to those of the first embodiment. 
[0079] The feature of this embodiment lies in that the 
pitch vector taking the effect of the pitch enhancement 20 
section 160 into account is used for adaptation of the 
pulse position candidates. By doing so, the efficiency is 
improved for the reason described below. The noise 
vector generated from the adaptive algebraic codebook 
is given a periodicity by the pitch enhancement section 25 
160. In the case where equation 1 is used for giving a 
periodicity, the pulses in the neighborhood of the head 
of the subframe are repeated many times within the 
subframe at pitch period intervals, while the pulses in 
the last half nearer to the tail are repeated to lesser 30 
degree. Observation of the noise code vector actually 
obtained shows that the stronger the pitch filter used, 
the higher the tendency of the pulses nearer to the head 
to rise. This indicates that the pulse position depends 
not only on the shape of the pitch vector but also on the 35 
pitch filter. According to this embodiment, the pitch filter 
inverse calculation section 174 is used to realize the 
adaptation of the pulse position candidates taking the 
effect of the pitch enhancement section 160 into consid- 
eration. 40 
[0080] According to the third embodiment, the noise 
vector is applied through two different types of filters 
including a pulse shaping fitter and a pitch filter. When 
applying the present embodiment in such a case, ide- 
ally, the characteristic of the two filters combined is 45 
determined, and the inverse characteristic of the char- 
acteristic is used for the pitch filter inverse calculation 
section. To avoid the increase in the processing amount, 
however, the use of only the characteristic of the pitch 
filter having a larger effect is also effective. Also, the so 
pitch filter inverse calculation section 174 and the 
smoothing section 175 can be reversed in order. 
[0081] FIG. 14 shows a configuration of a speech 
decoding system according to this embodiment corre- 
sponding to the speech encoding system of FIG. 13. ss 
The operation of this speech encoding system is obvi- 
ous from the operation of the speech decoding system 
described in the first to fourth embodiments and there- 



fore will not be described in detail. 
[0082] FIG. 15 is a block diagram showing a speech 
encoding system according to a sixth embodiment of 
the invention. The configuration of this speech encoding 
system is the same as that of the first embodiment 
except that the adaptive algebraic codebook according 
to the first embodiment is replaced by the noise vector 
generating section 180 and the amplitude codebook 
181. 

[0083] Now. the processing steps according to this 
embodiment will be explained. Like in the first embodi- 
ment, the first step is the LPC analysis and the LPC 
quantization, and upon complete search of the adaptive 
codebook 141 , the pitch vector is delivered to the pulse 
position search section 1 74. In the pulse position search 
section 174, the pulse positions are determined based 
on the power envelope of the pitch vector by the same 
method as in the first embodiment, and are output to the 
noise vector generating section. This embodiment is dif- 
ferent from the foregoing embodiments in that pulses 
are set by the noise vector search section at all the posi- 
tions determined by the pulse position search section 
174. Specifically, in the foregoing embodiments, the 
pulse position candidates are determined and the opti- 
mum pulse positions are selected by the adaptive alge- 
braic codebook. According to this embodiment, in 
contrast, all the pulse position candidates are used at 
the same time. Therefore, the processing for selecting 
the pulse positions is eliminated. Instead, the process- 
ing is added for selecting the amplitude of each pulse 
from the amplitude codebook 181. Also, the information 
D representing the pulse amplitude is output in place of 
the information c indicating the pulse positions. 
[0084] A method of generating a noise vector will be 
described in detail with reference to FIG. 16. The ampli- 
tude pattern obtained from the amplitude codebook is 
shown by arrow in the graph (a) of FIG. 16. This case 
assumes that seven pulses are raised. The waveforms 
(b) and (c) of FIG. 16 represent the pitch vector power 
envelope obtained at the pulse position search section 
174 and the corresponding pulse positions (indicated by 
circles in the diagram). In the waveform (b) of FIG. 16, 
the power has two high portions so that seven pulse 
positions are distributed to two positions. In the wave- 
form (c) of FIG. 16. in contrast, only one high portion 
exists at the center, at which the pulse positions are 
concentrated. The graphs (d) and (e) of FIG. 16 show 
noise vectors obtained by setting the amplitude pulses 
(a) of FIG. 1 6 at the respective pulse positions. It is seen 
that the shape of the excitation signal changes with the 
pitch vector power envelope. As already described, the 
information on the power envelope of the pitch vector is 
not required to be transmitted. According to this embod- 
iment therefore, the noise vector can be formed in an 
almost ideal shape without increasing the bit rate. 
[0085] In this embodiment, the higher the bit rate, the 
more pulse amplitude information D can be sent with an 
increasingly improved quality. Nevertheless, the degree 



9 



17 



EP 0 926 660 A2 



18 



of improvement progressively decreases. With a certain 
high bit rate, the performance may be improved more by 
including the noise vectors in the search candidates 
with pulses set at positions not selected than by 
increasing the amplitude information. Specifically, the 5 2. 
pulse position search section 174 outputs different 
pulse position patterns (pulse patterns), and the noise 
vector generating section searches the amplitude for 
each pulse pattern. A pulse pattern generated from the 
pulse positions not selected is produced in addition to 10 
the above-mentioned pulse pattern adapted to the pitch 
vector. A method can be cited, for example, in which all 
the sample positions of the subframe less the sample 
positions selected by adaptation are used as a second 
pulse pattern, so that the amplitude search is carried 75 3. 
out for the two pulse patterns. The number of bits allo- 
cated to the amplitude information can be varied from 
one pulse pattern to another. Normally, however, it is 
more efficient to allocate more bits to the pulse pattern 
that has used the adaptation. In the case of using a plu- 20 
rality of pulse patterns, it is necessary to include in the 
information D the information as to which pulse pattern 
is used. The amplitude information correspondingly 
decreases. However, the quality is higher than when 
searching only one pulse pattern. & 
[0086] FIG. 17 shows a configuration of a speech 
decoding system according to this embodiment corre- 
sponding to the speech encoding system of FIG. 15. 
The operation of this speech decoding system is obvi- 4. 
ous from the operation of the speech decoding system 30 
described in the first to fifth embodiments, and therefore 
will not be described in detail. 

[0087] Although a speech encoding/decoding method 
is described above with reference to embodiments, the 
present invention is also applicable to a speech synthe- 35 
sis method. In such a case, in the speech decoding sys- 
tem shown in FIGS. 5, 7 and 9. each index is 
determined based on a reconstructed speech signal to 
be synthesized. 

[0088] It will thus be understood from the foregoing 40 
description that according to this invention, a speech 
encoding/decoding operation of high sound quality can 
be performed even when using a pulse codebook with a 
decreased number of pulse positions and pulses due to 
the low rate encoding. 45 

Claims 

1. A speech encoding method characterized by com- 
prising the steps of: so 5. 

generating information representing character- 
istics of a synthesis filter; and 
generating an excitation signal for exciting said 
synthesis filter, the excitation signal including a 55 
pulse train generated by setting one or more 
pulses at a predetermined number of pulse 
positions selected from a plurality of pulse posi- 



tion candidates adaptively changed in accord- 
ance with the characteristics of said speech 
signal. 

A speech encoding method according to claim 1, 
characterized in that said excitation signal generat- 
ing step is for generating an excitation signal con- 
taining a pulse train generated by setting one or 
more pulses at a predetermined number of pulse 
positions selected from a plurality of pulse position 
candidates arranged so that pulse position candi- 
dates exist in a greater number at positions of larger 
power of said speech signal. 

A speech encoding method characterized by com- 
prising the steps of: 

generating information representing character- 
istics of a synthesis filter; and 
generating an excitation signal including a 
pulse train generated by setting one or more 
pulses at a predetermined number of pulse 
positions selected from a plurality of pulse posi- 
tion candidates adaptively changed in accord- 
ance with the property of said speech signal, 
an amplitude of each of the pulses being opti- 
mized by a predetermined means. 

A speech encoding method characterized by com- 
prising the steps of: 

generating information representing character- 
istics of a synthesis filter; and 
generating an excitation signal containing 
either one of a first pulse train and a second 
pulse train for exciting said synthesis fitter, the 
first pulse train being generated by setting one 
or more pulses at a predetermined number of 
pulse positions selected from a plurality of first 
pulse position candidates adaptively changed 
in accordance with characteristics of said 
speech signal, and the second pulse train 
being generated by setting one or more pulses 
at a predetermined number of pulse positions 
selected from a plurality of second pulse posi- 
tion candidates containing a part or all of posi- 
tions not used as the first pulse position 
candidates. 

A speech encoding method characterized by com- 
prising the steps of: 

generating information representing character- 
istics of a synthesis filter; and 
generating an excitation signal containing a 
pitch vector and a noise vector for exciting said 
synthesis filter, the excitation signal including a 
pulse train generated by setting one or more 
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pulses at a predetermined number of pulse 
positions selected from a plurality of pulse posi- 
tion candidates changed in accordance with 
the shape of said pitch vector. 

5 

A speech encoding method characterized by com- 
prising the steps of: 

generating at least information representing 
characteristics of a synthesis filter; and 10 
generating an excitation signal containing a 
pitch vector and a noise vector for exciting said 
synthesis filter and including a pulse train gen- 
erated by setting one or more pulses at a pre- 
determined number of pulse positions selected is 
from a plurality of pulse position candidates 
arranged so that pulse position candidates 
exist in a greater number at positions of larger 
power of said pitch vector. 



generating an excitation signal containing a 
pulse train shaped by a pulse shaping method 
having a characteristic determined based on 
the shape of said pitch vector. 

10. A speech encoding method characterized by com- 
prising the steps of: 



generating at least information representing 
the characteristic of a synthesis filter for a 
speech signal; and 

generating an excitation signal containing a 
pitch vector for exciting said synthesis filter and 
a noise vector including a pulse train generated 
by setting one or more pulses at a predeter- 
mined number of pulse positions selected from 
a plurality of pulse position candidates 
arranged to exist in a greater number at posi- 
tions of larger power of said pitch vector, said 
pulse train being shaped by a pulse shaping 
method having a characteristic determined 
based on the shape of said pitch vector. 



20 

A speech encoding method characterized by com- 
prising the steps of: 



generating information representing character- 
istics of a synthesis filter; and 25 
generating an excitation signal containing a 
pitch vector and a noise vector for exciting said 
synthesis filter, the noise vector including a 
pulse train generated by setting one or more 
pulses at a predetermined number of pulse 30 
positions selected from a plurality of position 
candidates set based on a pulse position can- 
didate density function obtained from the 
shape of said pitch vector. 

35 

A speech encoding method characterized by com- 
prising the steps of: 

generating information representing character- 
istics of a synthesis filter; and 40 
generating an excitation signal containing a 
pitch vector and a noise vector having a shape 
processed by a compensation filter for exciting 
said synthesis filter, the noise vector including 
a pulse train generated by setting one or more 45 
pulses at a predetermined number of pulse 
positions selected from a plurality of pulse posi- 
tion candidates changed in accordance with a 
shape of an inverse compensation pitch vector 
obtained by subjecting a computation based on so 
inverse characteristics of the compensation fil- 
ter to the pitch vector. 

A speech encoding method characterized by com- 
prising the steps of: 55 

generating at least information representing 
characteristics of a synthesis filter; and 



11- A speech decoding method characterized by com- 
prising the steps of: 

receiving an encoded stream containing infor- 
mation relative to a pulse train generated by 
setting one or more pulses at a predetermined 
number of pulse positions selected from a plu- 
rality of pulse position candidates adaptively 
changed in accordance with the character of 
said speech signal; and 
inputting the encoded stream to a synthesis fil- 
ter for reconstructing a speech signal. 

12. A speech decoding method characterized by com- 
prising the steps of: 

receiving an encoded stream containing a 
pulse train generated by setting one or more 
pulses at a predetermined number of pulse 
positions selected from a plurality of pulse posi- 
tion candidates arranged to exist in a greater 
number at positions of larger power of said 
speech signal; 

making an excitation signal including the pulse 
train; and 

inputting the excitation signal to a synthesis fil- 
ter for reconstructing a speech signal. 

13. A speech decoding method characterized by com- 
prising the steps of: 

receiving an excitation signal containing a 
pulse train generated by setting one or more 
pulses having a give amplitude at a plurality of 
pulse positions adaptively changed in accord- 
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ance with the character of said speech signal; 
and 

inputting the excitation signal to a synthesis fil- 
ter for reconstructing a speech signal. 

14. A speech decoding method characterized by com- 
prising the steps of: 

receiving an excitation signal containing one of 
a first pulse train and a second pulse train, the 
first pulse train being generated by setting one 
or more pulses at a predetermined number of 
pulse positions selected from a plurality of first 
pulse position candidates adaptively changed 
in accordance with the character of said 
speech signal, and the second pulse train 
being generated by setting one or more pulses 
at a predetermined number of pulse positions 
selected from a plurality of second pulse posi- 
tion candidates including a part or all of the 
positions not used as the first pulse position 
candidates; and 

inputting the excitation signal to a synthesis fil- 
ter for reconstructing a speech signal. 

15. A speech decoding method characterized by com- 
prising the steps of: 

receiving an encoded stream including a pitch 
vector and a noise vector containing a pulse 
train generated by setting one or more pulses 
at a predetermined number of pulse positions 
selected from a plurality of pulse position can- 
didates changed in accordance with the shape 
of said pitch vector; 

making an excitation signal including the pulse 
train; and 

inputting the excitation signal to a synthesis fil- 
ter for reconstructing a speech signal. 

16. A speech decoding method characterized by com- 
prising the steps of: 



receiving an encoded stream including infor- 
mation relative to a pitch vector and a noise 45 
vector containing a pulse train generated by 
setting one or more pulses at a predetermined 
number of pulse positions selected from a plu- 
rality of pulse position candidates arranged to 
exist in a greater number at positions of larger so 
power of the pitch vector; 
making an excitation signal including the pulse 
train; and 

inputting the excitation signal to a synthesis fil- 
ter for reconstructing a speech signal. ss 

17. A speech decoding method characterized by com- 
prising the steps of: 



so 



25 



30 



35 



40 



receiving an excitation signal containing a pitch 
vector and a noise vector for exciting said syn- 
thesis filter, the noise vector including a pulse 
train generated by setting one or more pulses 
at a predetermined number of pulse positions 
selected from a plurality of position candidates 
set based on a pulse position candidate density 
function obtained from the shape of said pitch 
vector; and 

inputting the excitation signal to a synthesis fil- 
ter for reconstructing a speech signal. 

18. A speech decoding method characterized by com- 
prising the steps of: 

receiving an excitation signal formed of a pitch 
vector and a noise vector having a shape proc- 
essed by a compensation filter, the noise vec- 
tor including a pulse train generated by setting 
one or more pulses at a predetermined number 
of pulse positions selected from a plurality of 
pulse position candidates changed in accord- 
ance with a shape of an inverse compensation 
pitch vector obtained by subjecting a computa- 
tion based on inverse characteristics of the 
compensation filter to the pitch vector; and 
inputting the excitation signal to a synthesis fil- 
ter for reconstructing a speech signal. 

19. A speech decoding method characterized by com- 
prising the steps of: 

receiving an excitation signal formed of a pitch 
vector and a noise vector, the noise vector con- 
taining a pulse train shaped by a pulse shaping 
method having a characteristic determined 
based on the shape of said pitch vector; and 
inputting the excitation signal to a synthesis fil- 
ter for reconstructing a speech signal. 

20. A speech decoding method characterized by com- 
prising the steps of: 

receiving an excitation signal formed of a pitch 
vector and a noise vector, the noise vector con- 
taining a pulse train generated by setting one 
or more pulses at a predetermined number of 
pulse positions selected from a plurality of 
pulse position candidates arranged to exist in a 
greater number at positions of larger power of 
said pitch vector, the pulse train being shaped 
by a pulse shaping method having a character- 
istic determined based on the shape of said 
pitch vector; and 

inputting the excitation signal to a synthesis fil- 
ter for reconstructing a speech signal. 
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